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Abstract. Consider a system F of n polynomial equations in n unknowns, over an alge- 
braically closed field of arbitrary characteristic. We present a fast method to find a point in 
every irreducible component of the zero set Z of F. Our techniques allow us to sharpen and 
lower prior complexity bounds for this problem by fully taking into account the monomial 
term structure. As a corollary of our development we also obtain new explicit formulae for 
the exact number of isolated roots of F and the intersection multiplicity of the positive- 
dimensional part of Z. Finally, we present a combinatorial construction of non-degenerate 
polynomial systems, with specified monomial term structure and maximally many isolated 
roots, which may be of independent interest. 



1. Introduction 



The rebirth of resultants, especially through the toricQ resultant ||GKZ94|| , has begun to 



provide a much needed alternative to Grobner basis methods for solving polynomial systems. 
Continuing this philosophy, we will use toric geometry to derive significant speed-ups and 
extensions of resultant-based methods for solving polynomial systems with infinitely many 
roots. 

The importance of dealing with degenerate polynomial systems has been observed in earlier 
work on quantifier elimination over algebraically closed fields ||CG84| , [Ren87| , |Gan88|| : Many 
reasonable algorithms for polynomial system solving fail catastrophically when presented 
with a system F (of n polynomials in n unknowns) having a positive-dimensional zero set 
Z. Even worse, this kind of failure can also occur when F has only finitely many roots, if F 
has infinitely many roots "at infinity." When such failures occur, it is of considerable benefit 
to the user to at least be given some sort of description of the zero-dimensional part of Z. 

We will present two new techniques for handling such degeneracies. The twisted Chow 
form (cf. Main Theorem ^ allows one to quickly coordinatize many (but not all) degenerate 
Z, simply by injecting some extra combinatorics into the classical w-resultant. Our second 
technique builds on the twisted Chow form and works for all degenerate Z: The toric 
perturbation (cf. Main Theorem ^) refines and generalizes an earlier algebraic perturbation 
trick used by Chistov and Grigoriev ||CG84|] , Renegar |[Ren87|| , and Canny ||Can90| . 
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^ Other commonly used prefixes for this modern generalization of the classical resultant |Van50| include: 
sparse, mixed, sparse mixed, A-, {Ai, . . . ,Ak)-, and Newton. 
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Our refinement takes sparsity into account and allows one to replace the polynomial de- 
grees present in earlier complexity bounds by more intrinsic geometric parameters (cf. Main 
Theorems and We will see in sections |3.4| and ^ that our bounds are a definite improve- 
ment, sometimes even by a factor exponential in n. Our framework also allows us to work 
over any algebraically closed field (as opposed to some earlier restrictions to the complex 
numbers) and to isolate the zero-dimensional part of Z. 

We also derive four corollaries which may be of independent interest: 

1. An explicit method to compute field extensions involving the roots of F (Corollary |l|). 

2. An exphcit formula for the exact, as opposed to generic, number of isolated]^ roots of 
F (Corollaries ^ and ^. 

3. A combinatorial construction, within polynomial time for fixed tt,, of explicit F with 
specified monomial term structure and no roots "at infinity" (Main Theorem 0). 

4. A lower bound (conjecturally an explicit formula) for the intersection multiplicity of 
the positive-dimensional part of Z (Corollary |^). 

Our main results are stated precisely in section |^. We then give several simple examples of 
our main results in section |^. There we also give an intuitive discussion of roots "at infinity" 
and show how our results include Canny's earlier generalized characteristic polyno- 
mial (GCP) as a special case. Section ^ then details our aforementioned combinatorial 
construction of "generic" F with specified monomial term structure. Our main results are 
then proved in section |^, and we discuss the computational complexity of our techniques in 
section ||. 

2. Summary of Main Results 

Before describing our results in detail, we will introduce some necessary notation: In what 
follows, we will let F:= (/i, . . . , /„+i), where for all i, fi{x) = Ylia(^Ei ^i^aX""^ Ei is a nonempty 
finite subset of (NU {O})", and x° is understood to be the monomial term y. Given 

the Ci^a, we will be solving for x := (xi, . . . , x„). So the (n + l)-tuple E := {Ei, . . . , E^+i) 
thus controls which monomial terms are allowed to appear in our systems of equations. 
An accepted shorthand is to say that is an {n -\- 1) x n polynomial system with 
support contained in E. (This generalizes in an obvious way to k x n systems.) 

Of course, our given polynomial systems will usually be nxn, so we will let F:={fi, . . . , /„) 
and E := {Ei, . . . ,En). We also let Conv(_B) denote the convex hull of (i.e., smallest con- 
vex set containing) a point set B C M", and let [k] := {1, ... ,k} for any positive integer 
k. An important geometric invariant for nxn systems of equations is A^{E) — the 
mixed volume [ [BZ88| , |Sch94| , |GK94| , [EU95| , |Ewa96| , pGH98|| of the convex hulls of the 



Ei. For {n + 1) X n systems, we also have the following two important complexity-theoretic 



aye N 
E n 



parameters: R{E) := Y^Z^ M{Ex, . . . , ^i-i, ^i+i, . . . , ^n+i) and S{E) = 0{y/ne'^M 
where A^^^ is the average value of A4{S) as £ ranges over all n-tuples {£i, . . . ,Sn) with 
Sj G {El, . . . , En+i} for all j G [n]. The true definition of S{E) depends on the efficiency of 
a particular class of algorithms described later in sections |3.2| , |5.1| , and ^ 

^By an isolated root, we will simply mean a root not lying in a positive-dimensional component of Z. 
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We will usually take all polynomial coefficients to be constants in a fixed algebraically 
closed field IK or polynomials in K[s] for some new parameter s. Also, we let IK* :=]K\{0} 
and A := Conv({0,ei, . . . ,e„}), where O G M" denotes the origin and Cj G is the i— 
standard basis vector. Finally, using # for set cardinality, let (pA- (K*)" — > ^t^~^ be the 
rational map defined by x i-^ [x" | a G A] . On occasion, we will extend the domain of ipA to 
a suitable toric variety (cf. section 

2.1. Finding Points in All Components in Intrinsic Polynomial Time. 

Our first main result allows us to efficiently use exact arithmetic to find a point in every 
irreducible component of Z. In what follows, 0*{T) means 0{T log^ T) for some constant 
r >0. 

Main Theorem 1. Let F be an n x n polynomial system with support contained in E, 
assume M{E) >0, and set E^i+i = ^4 = AflZ". Also let '^a^Z) he the zero se^ of F in P|^. 
Then we can find univariate polynomials h,hi, . . . ,hn with the following properties: 

0. The degrees of h and hi, . . . ,hn are all bounded above by /iA{E). 

1. For any root 9 of h, define (9) := {hi{9), .. . ,hn{9)). Then -f{9) e(K*f ^ -f{9) is a 
root of F. 

2. There is at least one 7(6*) in every irreducible component o/y9^(2)n(K*)"'. In particular, 
the set of points {7(6')}/j(e)=o contains all the isolated roots of F in (IK*)*^. 

3. Let K be Q(cj^a | i G [n],a G Ei) or (Z/pZ)(cj^a | i G [n\,a G Ei), according as charK 
is zero or a prime p. Then all the coefficients of h, hi, . . . ,hn (and all intermediate 
calculations thereof) are in K, or a degree [21ogp((?7, + 1)M.{E))'\ algebraic extension 
of K, according as charIK is zero or p. 

Furthermore, we can find h,hi, . . . ,hn deterministically within 0*{n'^Ai{E)^R{EyS{EY'^'^^) 
arithmetic steps and 0{nS{EY) space. Finally, at the expense of replacing E by O U E := 
({O} U El, . . . , {O} U En), we can ensure that {'y(9)}h{e)=o includes all the isolated roots of 
F in K" as well. 



Remark 1. The above time bound can be reinterpreted as "near-heptic in the number of 
roots of a system closely related to F. " Also, depending on the combinatorial data E and the 
algebraic data charIK, the above complexity bounds can be lowered considerably, especially if 
randomization is allowed. These improvements are detailed further in section In particu- 
lar. Main Theorem already improves an earlier intrinsic complexity bound due to Giusti, 
Heintz, Morais, and Pardo ||GHMF95|| 



Remark 2. The assumption that M.[E) >0 can actually be checked in polynomial time, via 
lemma |I] of section 0. Furthermore, if M.{E) = {], then we can simply add <n appropriately 
chosen points to E (within the same asymptotic time bound) to make M.{E) positive. 



^Zero sets in projective s pace (and m ore general toric varieties) are defined in section ^. 

*It should be noted that | |GHMP9"5 | also deals with the more general problem of complexity bounds for 



polynomial system solving in terms of arithmetic networks and straight-line programs. 
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Our first main theorem thus removes a geometric/complexity-theoretic bottleneck from 
solving polynomial systems: For example, fast algorithms for finding approximations within 
£ > of all the roots of F in (C*)", within time (9*(12'^A^(i?)^ log log (neglecting some 
preprocessing), have recently been announced by Mourrain and Pan ||MP98|| . However, their 



algorithm assumes that Z is zero-dimensional and ]K = C. On the other hand, the algorithms 
of Canny from ||Can88| , |Can90|| can handle^ positive-dimensional but they assume K = C 



and result in a Las Vegas complexity bound of O* {ndji\^^ ^ ), where dji and d^. are 

respectively the product and sum of the total degrees of the /j. We will see in sections ^ 
and 1^ that our algorithm above is at least this fast, and is in fact frequently much faster. 
We also point out that when Z is positive-dimensional, Grobner basis techniques for solving 
F suffer from a worst-case arithmetic complexity doubly exponential in n 



Main Theorem |I| is also useful for certain rationality questions via the following corollary, 
proved in section [5 .21. 



Corollary 1. Following the notation of Main Theorem suppose now that charK = and 
F has only finitely many roots in (K*)". Let g be the greatest common divisor of h and 
nr=i Then K{(i \ {(i, ... , Cn) £ (K*)" is a root of F) is exactly the splitting field of g. 

To make Main Theorem |l| more precise, we now outline its underlying toric geometric 
techniques. 

2.2. Main Geometric Results. 

First recall that there is a natural addition of point sets in defined hj B + B' := {b+b' \ 6g 
B,b' G B'}. In the notation of [ |Roj97a| , [Roj97b|| , we can associate to any {n + l)-tuple of 



point sets in Z", E, a toric resultant Res£;(F). This important operator is amply detailed 
Stu93| , |Stu94| , |GKZ94| , [EC95| , |Stu98|| , so let us state our first geometric construction. 



m 



Definition 1. Lei P := ^"^^ Conv(Ei) and P := P + Conv(E„+i). Also let Ac Z"^ be any 
finite subset with at least two points and define /n+i(x) := XlaeA ""a-^" '^'^^ (""a I (^^^), 
where the Ua are new parameters. We then call Chow^(ii) := Res(£;^^)(P, a twisted 
Chow form of F . (Frequently, we will set En+i = A and thus P = P + Conv(y4) as well.) 

Note that Chow^(M) will be a polynomial in the parameters Ua, encoding (in a manner to be 
described below) the roots of F. Twisted Chow forms are a generalization of the classical u- 
resultant [ |Van50|| since the latter simply corresponds to the case where we use the classical 
"dense" resultant and let A = Ar\ Z". For convenience, we will frequently respectively write 
uo and Ui in place of uq and Wg,. 

Example 1. Suppose we take charK ^ {2, 3}, n = 2, Ei = E2 = 2A f] , A = An Z^ and 
F = {1 + 2y — x"^ + y"^,! + 2x + x"^ — 4?/^). Then Chow^ is simply the u-resultant, and this 
polynomial in Uo,Ui,U2 factors (modulo a nonzero constant multiple) as {uq + |mi — 1^2) x 
{uq + 3ui + 2u2){uo — ui)^. It is also not hard to see that F has exactly three roots: (|, — |), 
(3,2), and (—1,0); the last occuring with multiplicity 2. Better still, we can read this off 



^That is, construct h,hi, . . . ,hn as in Main Theorem 1. 
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directly from our u- resultant by computing ( eocffidcnt of coefficient of IV ^^^^ linear factor 
(with uq appearing) of the u-resultant. (See Main Theorem^ below.) 

Our next main theorem tells us exactly how and when we can use a twisted Chow form to 
compute monomials in the roots of F. Recall that to any n-dimensional rational polytope 
QcM" one can associate its corresponding toric variety (over K) T{Q) ||KKMS73| , Pan78| , 
|KSZ92| , [Ful93| , |GKZ94| , |Roj99|| , and this T{Q) always has0 a naturally embedded copy of 



(K*)". To state our results fully, we will require some toric variety terminology, but the 
underlying idea is simple: By working in compactifications more general than the projective 
spaces {PkI^i, we can make better use of the monomial term structure of our polynomial 
systems. 

Main Theorem 2. Following the notation of definition^, set En+i =A and let Z denote 
the zero set of F in T{P). Then ChowA(M) is a homogeneous polynomial, either identically 
zero or of degree M.{E), with the following properties: 

1. The polynomial Chow^ is indentically zero <^=^ ipA{Z) is positive- dimensional. 

2. If(ET{P) is a root of F then Chow a is divisible byJ^aeA'^aUa, where [ja \ ct^^] = V^a(C)- 

3. The polynomial Chow a{u) splits completely (overK.) into linear factors. In particular, 
if Chow A^O and a nonzero linear form J^aeA'^aUa divides Chow^, then [ja \ aEA] = 
V^yi(C) /or some root ( &T(P) of F . 

The zero set of F in a toric variety is formalized in section ^. Note in particular that 
assertions (2) and (3) tell us that calculating ChowA('u) allows us to reduce the computation 
of the projective coordinates [C" | a G A], for any root C £ ^(-P) of F) to a multivariate 
factorization problem. Of course, this reduction only works if Chow a{u) is not indentically 
zero, and assertion (1) tells us exactly when this happens. 
We also obtain the following almost immediate corollary. 

Corollary 2. Following the notation of Main Theorem we may check if Chow^i is iden- 
tically zero (and thus whether dim (Pa{2)>0) within 0*{n'^M{E)R{E)S{E)'^-^'^^) arithmetic 
steps and 0{nS{E)'^) space.^ Furthermore, if Chow^(-u) does not vanish identically, then 
we can compute the exact number of roots of F in (K*)", counting multiplicities, within 
0*{n^M{EfR{E)S{EY-^'^^) arithmetic steps andO{nS{Ef) space.'^ 

Even better, by combining with corollary ^ of section ^ we can also see how many roots lie 
at various parts of "toric infinity." Corollary ^ thus generalizes Bernshtein's famous mixed 
volume bound | [Ber75| | to exact root counting over an algebraically closed field. 



However, there is still another improvement to be made: It is actually possible for F to 
have infinitely many roots in T(P) but only finitely many roots in (K*)". In such cases. 



^It is not always the case that T((3) also has a naturally embedded copy of K". However, with some 
extra work, one can modify Q so that this is true. 

^Just as in Main Theorem 1, these complexity bounds can be significantly lowered under certain reasonable 
assumptions. Also, unless otherwise stated, arithmetic steps will always be counted over the finite extension 
of K described in Main Theorem |^. 
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sometimes the right A will permit an exact count of the roots of F in (K*)" via Corollary 
El. For example, it is easy to construct F, A, and A' where Chow^ vanishes identically but 



ChoWyi/ does not (cf. section |3.3| ). On the other hand, those F with infinitely many roots in 
(K*)" will never have a nontrivial twisted Chow form. 

Our next construction works for all F and A, and begins as follows: 

Definition 2. Following the notation of Main Theorem^, assume further that Ai{E)>0. 
Let F* be anynxn system with constant coefficients and support contained in E, such that F* 
has only finitely many roots in T'(P). We then say that 7i{u; s) ■.= Res(^E,A){F — sF*, /„+i) 
(where s is a new indeterminate) is a toric generalized characteristic polynomial for 
{F,A). Furthermore, we define Pert a,f* (u) & K[ua \ a&A] to be the coefficient of the term 
of T-C{u] s) of lowest degree in s. We call Pert^.F* « toric perturbation of (F,A) and, 
when no confusion is possible, we will sometimes write Pert^ instead. 

The polynomial Pert^i is what we can use in place of Chow^ when Chow^i vanishes identi- 
cally. We will describe this shortly, but first we digress momentarily to describe how to con- 
struct the necessary "generic" F* above: If we simply fix the support of F* to be E, and pick 
random numbers for the coefficients (using any probability distribution on ]K;#monomiai terms 
yielding probability 1 avoidance of algebraic hypersurfaces), lemma ^ of section | tells us 
that F* will satisfy the above hypothesis with probability 1. Alternatively, a deterministic 
method for constructing suitable F* is the following. 



Definition 3. ||Ro.i94| , |RW96|| Given n-tuples D := (Di, ... , Dn) and E := (^i, . . . , E^) of 



nonempty compact subsets ofM."", we say that D fills E (or D is a fill of E) iff (0) DiCEi 
for all i e [n] and (1) Ai{D) = Ai{E). We then call D irreducible iff the removal of any 
point of D causes M.{D) to decrease. 



Main Theorem 3. Following the notation of definition^, suppose EiC'Z'^ for alii, Ai{E) > 
0, and D is an irreducible fill of E. Then, for any choice o/ nonzero Cj.a gK*, the polynomial 
system (XlaeDi ^i.a^;", • • • , X]aeD„ (^n,aX"') has exactly A4{E) roots, counting multiplicities, in 
(K*)" and no roots in T(P)\(K*)". Furthermore, letting m:=Y^^=iH^Ei, an irreducible fill 
of E can be found within time 0{nm?"^^^'^^^), for some absolute constant OO. 

Some simple examples of fills appear in section |3.1| and we present further background on 
filling in section |. We emphasize that while it is much more practical to pick a generic F* 
via randomization, the cost of derandomizing via fills can sometimes be amortized when one 
solves many F with similar monomial term structure. In particular, the selection of an F* 
need only be done once for a given n-tuple E, regardless of the coefficients of F. 
Toric perturbations improve on twisted Chow forms as follows: 

Main Theorem 4. Following the notation of definition 2, Pert^('u) is a nonzero homoge- 
neous polynomial of degree M.{E) with the following properties: 

1. ChowA ^ <^==^ '^(■s) has a nonzero constant term. Also, when the latter holds, 
ChowA = Pertyi . 
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2. If ( E T{P) is an isolated root of F then Pert^ is divisible by YlaeA'^aUa, where 
[-fa I aeA] = ^A{C)- 

3. The polynomial Pert a{u) splits completely (over K.) into linear factors. In particular, 
extending the correspondence of assertion (2), for every irreducible positive- dimensional 
component W of Z, there is at least one factor of Veil a corresponding to a root C^^W . 

Furthermore, we may evaluate Pertyi at any point in K*"^ within 0*{nR{E)'^S{E)^-^'^^) arith- 
metic steps overK. and 0{nS{EY) space^ 

We emphasize tliat tlie main advantage of Pert^ is that we can pick any A we prefer and 
still get a useful analogue of Chow^. For instance, even if the w-resultant unluckily vanishes 
identically, we can always simply set ^4 = A fl Z*^ and directly read off the coordinates of the 
isolated roots of F from the factors of Pert^(M) (assuming one can do multivariate factoring 
over K). Indeed, PertAnZ" and assertion (3) are central to our explicit construction of points 
in every irreducible component^ of not to mention the proof of Main Theorem |l]. 

Better still, we can sometimes (conjecturally always) distinguish which roots of F are 
isolated. 

Corollary 3. Following the notation above, let Zq and Z^ respectively denote the zero- 
dimensional and positive- dimensional parts of Z. Then Z^ fl (K*)" = =^ we can count 
the number of points in Zq fl (K*)", with or without multiplicity, within the same asymptotic 
complexity bounds as stated in Main Theorem More generally, there is a randomized 
algorithm which computes upper bounds on the cycle class degrees deg^o ond deg2on(K*)", 
and a lower bound on deg^oo; within the same complexity bounds. Conjecturally, these 
bounds are all actually explicit formulae with probability 1. 

A simple example of this final main result (and Main Theorem ^ also appears in section 
p.2| . So in summary, as the zero set of F in T(P) becomes more and more degenerate, we 
can successively use Corollaries || and ^ to count roots in (K*)" with complete generality. 
We also point out that a special case of Corollary |^ was used in ||Roj98a|| in connection with 
a fast general algorithm for exact multivariate root counting in (M*)". 

We can also construct the corresponding analogues of h and the hi to describe Zq explicitly, 
but this becomes more technical (cf. section |5.7| ). The same can be said for the analogous 
results in K", and this is covered in greater depth in |[Roj97b|| and [[Roj98c|| . We thus obtain 



a first step toward an algorithmic foundation for excess intersections. (See |l:''ul84| ] for a brief 
historical description of this problem.) 
We now illustrate our results and theory. 

3. Examples 

We begin with two small examples of filling. We will then see applications of the toric 
GCP and twisted Chow form to some degenerate 2x2 and 3x3 polynomial systems. Finally, 

* Just as in Corollary || and Main Theorem 1 , these complexity bounds can also be significantly lowered 
under certain reasonable assumptions. 

^The analogue of assertion (3) had been conjectured for Canny's GCP. We have thus proved this conjecture 
and generalized it to the toric GCP. 
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we will see a brief comparison of the toric GCP to the original GCP. In what follows, we will 
sometimes respectively write x, y, and z in place of xi, X2, and X3. 

3.1. Filling Squares and Cubes. 

For our first example, consider the pair of rectangles P:= ([0, a]x [0, b], [0, c]x [0, d\) where a, 
b, c, and d are positive integers. Then it is easily verified (via theorem ^ of section ^ that 
the pair D = ({(0, 0), (a, b)}, {(0, d), (c, 0)}) fills V. In this case, the mixed area of both pairs 
is easily checked to be ad + be. Note also that D is a pair of oppositely slanting diagonals of 
our initial pair of rectangles (modulo taking convex hulls). Finally, it is easily checked that 
D is indeed irreducible, since the removal of any point of D results in a mixed area of 0. 

By Main Theorem ^, we thus obtain that for any ai, 02, A, /92 GlK*, the bivariate polyno- 
mial system {ai + a2x'^y^ , Pix"" + (3211^) will have exactly ad + bc roots, counting multiplicities, 
in (K*)2. 

For our second example, let V instead be a triple of standard cubes (so that the vertex 
set of each cube is simply {0, 1}^). Then, using the criterion from theorem ^ once again, it 
is easily verified that the triple D = ({(1, 0, 0), (0, 1, 0), (0, 0, 1)}, {(1, 1, 0), (1, 0, 1), (0, 1, 1)}, 
{(0, 0, 0), (1, 1, 1)}) fills V. (This is depicted in Figure 1 below.) Also, it is easily checked 
that the mixed volume of both triples is 6. Finally, note that this D is irreducible as well 
by theorem ^. Alternatively, one can easily check this by brute force, using any one of the 
publically web-accessible software packages for mixed volume computation by Emiris, Gao, 
Huber, or Verschelde. 











\v- 






— 4 





Figure 1. An irreducible fill of three 3-cubes. 



By Main Theorem |^, we thus obtain that for any ai, 02, /?2, /^s, 7i, 72, 73 ^ K*, the 
trivariate polynomial system {a\ + a2xyz^ [3ix + /?2Z/ + P^z, lixy + 72x2; -|- -^yz) will have 
exactly 6 roots, counting multiplicities, in (K*)^. 

In summary, theorem ^ of section ^ gives a necessary and sufficient criterion for D to fill 
a given n-tuple E, and Main Theorem ^ tells us that we can construct some irreducible fill 
for E within time singly exponential in n. 

3.2. Pert^ Applied to a Degenerate 2x2 System. 

Consider the bivariate polynomial system 

F={1 + 2x — 2x'^y — 5xy + + 3x^y, 2 + 6x — Qx^y — llxy + + bx^y) 

over any field of characteristic not equal to 2, 3, or 7. Letting E be the support of F, 
the reader can easily verify that M.{E) = 4, and that the only roots of F are the points 
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{(1, 1), (i, I)} and the line {-1} x K.0 So it would appear that the u-resultant (and even 
ChowAnzO will vanish identically and not give us any useful information about any of these 
roots. Let us see how we can use Pert^ (with ^4 = A fl Z^) to recover everything we need to 

know about the roots of F. 

First, via combinatorial means |[Stu93| , |li)C95|| , we construct a toric resultant matrix, 
M^- This matrix has the property that its determinant is a multiple of the toric resultant 
defining the toric GCP (the precursor to Pert a)- With the assistance of a Matlab program, 
res2 .m (publically available from the author's web-page), we can obtain the following 17 x 17 
matrix: 



Me = 



Ul 























U2 























Mo 


Uo 

























U2 





























U2 

















Ul 




















Mo 














U2 
















Ul 





























63 


64 


65 











62 

















bo 


bi 














63 


bi 


65 








bi 


62 

















bo 

















63 


64 


65 





bo 


bi 








62 





























63 


64 


65 





bo 








bi 


b2 























as 


04 


05 





oo 


Ol 








02 





























03 


04 


05 





00 








Ol 


02 



































04 


05 


00 


Ol 











a3 


02 


0-2 























a3 


04 





Oo 


05 











Ol 


b2 























63 


64 





60 


65 











bi 


bi 


62 























63 








64 


65 








bo 








a3 


04 


05 











a2 

















Oo 


Ol 














a3 


04 


05 








Oi 


a2 

















Oo 





ai 


a2 























a3 








04 


05 








Oo 



where the (resp. bi) are indeterminates correponding to the coefficients of /i (resp. /2). 
Note in particular that R{E) = 4 + 4 + 4 = 12. As for the other complexity parameter S{E), 
its true definition is the size of any available toric resultant matrix. So S{E) = 17 in the case 
at hand. 

Now note that by theorem |, D := ({O, (3, 1)}, {(1, 1), (2, 0)}) is an irreducible fill of E. 
So by Main Theorem ^, we can take F* = {1 + x^y, xy + x^) and apply Main Theorem ^ 
to construct the toric GCP, H{u; s). By setting (oq, ... ,05) = (1 — s, 2, —2, —5, 1, 3 — s), 
(60, • • • , ^5) = (2, 6, —6, —11 — s, 4 — s, 5), and taking the determinant of M^, we then obtain 
a nonzero constant multiple of Ti.{uo, Ui, U2; s). 

However, multivariate symbolic expansions are typically slow and memory-intensive. So 
to efficiently "solve" F — that is, to quickly find a point in every irreducible component 
of its zero set — we will instead compute the univariate polynomials h,hi,h2 from Main 
Theorem 1 via interpolation. The polynomial h is derived simply by specializing Pertyi at 
some suitable value of {ui,U2) and then interpolating through 1 + A4{E) values of uq. The 
derivation of hi and h2 is essentially the same but involves an additional intermediate step 



^°For n = 2, there is the simple formula M{E) = Area(Conv(i?i + E2)) — Area(Conv(i?i)) — Area(Conv(i?2)). 
Also, both polynomials are divisible by x + 1. Furthermore, when charK = 2, the second isolated root 
becomes an isolated root lying on the a;-axis. 
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described in section |5]l|. Since Pert^ is in turn a coefficient of T-C{u] s), there is also another 
level of interpolation through 1 + S{E) — Ai{E) values of s. 

For example, setting (^1,^2) = (|, 1) (and setting uq equal to a parameter t), we easily 
obtain via Maple that 

h{t) = -153 + 120t + 1540t2 + 1600t^ + 448t^ 

, , , 11762 19150 114736 2 7264 . 

hi(t) = \ 1 H r H r 

^ ^ 7511 22533 22533 3219 

, , , 5881 32108 57368 2 3632 , 

h2(t) = \ 1 H r H f. 

^ ' 7511 22533 22533 3219 

Since h{t) factors as {2t + l)(2t + 3)(4t — l)(28t + 51), we thus immediately obtain from 
Corollary |I| (and the fact that ui and U2 were chosen within K) that the zero-dimensional part 
of 2 n (K*)^ actually lies in {K*Y, where K is the quotient field generated by the canonical 
image of Z in IK. Furthermore, by Main Theorem 1 (and the fact that O G £^1 H £^2); we 
can at last recover a set of points lying in Z (including all the isolated roots of F in K^) by 
substituting { — f,!,— f|} into the pair {hi(t), h2{t)). 

As for the full expansion of the toric GCP, we can easily compute via Maple that ?^(m; s) 
is, up to a nonzero constant multiple, 

(U2 — Uq + U-i + bUiU2 — 4UiU2 ~ 4UiU2)S 

+ (36ltill2 — 20u2lto — 20lt2?iO — iuiUQ — Idug — 24M2 + &1H)U\U2 

3 4 2 2222 2 3 37 

+36wiU2 + 36ui — 12MoUiti2 — 'du^UQ + 3?i2?to + 36mouiM2 — 4uoMi — 84iii?i2)s 

+(220^2 - 170ii2iio - 394u?U2 - 98iiiUo - 98iioUi«2 - 20uo + 370u2Mo 
+14itoiiiJi2 — llOuou? — 226tiiiio — 354iiiii2 + 454ui — 274uoUiU2 + 74itiit2)s^ 
+(1008u2?io - 1612MoitiU2 + 903?io - 624iiiiio - 2632?i2?io - 2104uo lll?i2 — 970U2 
-lOlOiima + 418u?U2 - 2104Mo^iiii2 " &^1u{ui ~ 1547^1^0 - 936uou? - \^^lu%u% + 2204ut)s^ 

+(538uoiiiU2 + 1271^^0 + 12253ii2?io + 6972m2m2 + 1929ui - ?,m^u\u\ + 654?ioiiiU2 
+50uiui] + 2156ii2 - 960ti?uo - 2290uo?i? + 132uiU2 - 5344uoUiM2 - 1142m? U2 + 8708^2^0)3" 
+(4384miMo - 24988^^110 - 1582u?u2 - 6756uo + 10884uoMiMi + 3802uiui + 15438mou?U2 
+ 1024UOM? + 8324«?uo - 12826^^1^0 + 11270mo^iM2 - 6976«^ + rWAulul - 21326u2ul - 2408u|)s^ 
+ (3436u?U2 + 3800U0M? + 7756^2^0 - 3886uiU2 + 1225^2 + 17059u2Uo - 5984m? mq 
+ 15708u2iio - 12232uoMiU2 + 5180mo - 2091^?^^ - 6828uou?M2 + 1316ui - 12700uoMiM2 - 4312uiUo)s^ 

+ (384w^tii - 1792w^it2 + 512u^w^ + 1536u^uiU2 + 1920uoWiu| - 1288^0^^ - 768ufu2 
-448it^ - 2436u^m| - 384ttouf + 1024uoWiM2 - 64uf + 260w^it| + 768tiiM^ - 196w|)s. 
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So our toric perturbation PertA,F* is just the coefficient of s or in this polynomial, ac- 
cording as charK7^2 or charK = 2. 

Let us now examine Pert^^^* itself in detail: Factoring with Maple, we obtain that Pert^ j?* 
splits as follows: 

-4(mo + Ml + M2)(28mo + 4mi + 49^2) (mq - Mi + M2)(4mo - 4mi + ^2) 

In particular, given any factor above, the ratio of the coefficients of Ui and Mq is precisely 
the i— coordinate of some corresponding root of F. Thus the ffist two factors correspond 
precisely to the two isolated roots we already know. As for the last two factors, note that 
they both give isolated points lying on the aforementioned line {— IjxIK. We can then guess 
that this line should be assigned an excess intersection multiplicity of 2. Of course, we might 
not know at the outset which of these roots is isolated, i.e., a zero-dimensional component 
of Z. However, since the constant term of 7i(s) vanishes, assertions (1) of Main Theorems 
H and I at least tell us that Z is indeed positive-dimensional. 

To distinguish the isolated roots, let us employ an algorithm from the proof of Corollary 
0: Apply Main Theorem ^ once more to pick F** = (1 + x^y, xy + 2x^). Noting that (due to 
their second equations) F* and F** will have no roots in common in (K*)^, let us then define 
the double toric perturbation, Pert^*, to be the greatest common divisor of Pert a,f* and 

Repeating the same calculation we used for h,hi,h2, but with Pert^ instead, we obtain 
new polynomials h**, h^*, h^*. Let us compute the gcd, g**, of h** and hl*hl*. It then turns 
out that the number of isolated roots of F is at most degh** — degg** (cf. section p. 7] ). 

More exphcitly, via Maple again, we easily see that h**(t) = {2t + l)(2t + 3) and g**{t) = 1. 
So the number of isolated roots in (K*)^ is at most 2, and the positive-dimensional part of Z 
(the line { — 1} x K) should be assigned an intersection multiplicity of at least M.{E) — 2 = 2. 
Fortuitously (conjecturally always), our lower bound is actually an equality. 

For completeness, we now reveal Veii a,f** (up to a constant multiple): 

{Uq + Ml + U2)(28Uo + 4Ui + 49U2)(mo - Ui + ^ U2){Uq - Ui - ^ U2). 

(In particular, VeiiA^p** is again the coefficient of s in T-C{s).) Note also that the last two 
factors of this toric perturbation again correspond to roots lying on the line {— IjxIK. We 
thus see that varying the coefficients of our perturbation of F has moved two of our points 
lying in Z. 

Note (via Maple again) that the original GCP could have been used above, but would 
have resulted in a variant of Pert^ of degree 16 (the product of the degrees of /i and /2) — 
four times larger than the degree of our Pert^. Also, the old GCP is significantly larger, 
having 672 terms, compared to 110 for our above toric GCP T-C{u; s). 

3.3. Which Compactification for Chow^? 

Here we show how the twisted Chow form Chow^ can vanish identically for the wrong A, 
thus giving no information about the roots of F. Along the way, we will also obtain a more 
precise visualization of the toric compacta Pj|, T(P), and T{P). We also point out that 
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while it is customary to consider the roots of F in T(P) (as in |[Ful93| , |GKZ94| , |Roj99|| ), the 



construction of Chow^ and Pert^ necessitate the consideration of roots in T(P) as well. 

To define our next example, set n = 3, A = A n Z^, and consider the 3x3 system F = 
{aiyz + a2xz + a^xy + a^xyz, hiyz + h2xz + h^xy + b^xyz, ciyz + C2xz + c^xy + c^xyz). Note 
that the mixed volume bound for this system is 1. Furthermore, it is clear that —F is a 

•J ' xyz 

linear system in {i, i, i}. So by Cramer's rule, we can express x, y, and z as ratios of 3 x 3 
determinants in the coefficients. 

Combining this with the product formula for toric resultants [ PS93|| (and clearing de- 
nominators) we obtain that Chow^ is precisely^ [423] [143] [124] uq + [123][143][124]ui + 
[123] [423] [124] U2 + [123][423][143]m3 where the bracket [ijk] [pg95|] is the 3 x 3 subde- 
terminant 



det 



ai ttj ak 
hi bj bk 

Ci Cj Ck 



of the coefficient matrix of F. This compactly expressed resultant can be thought of as a 
semi-mixed Chow form — a toric resultant of a system of n + 1 polynomials with k^n 
distinct supports. 

Now consider the specialization of F to {yz + xz + 2xy + 3xyz, yz + xz + Axy + 9xyz, 
yz + xz + 8xy + 27xyz). It is then easily verified that F has no roots in (K*)'^, but F does 
have exactly one root^ in T{P). Also, in our particular example, T{P) = Pk and, locally 
(within (K*)"), the isomorphism is given by {x,y,z) i— > [i : i : i : 1]. In particular, using the 

xyz 

latter set of coordinates, our one root of F in T{P) is exactly the point [1 : — 1 : : 0]. More 
to the point, Chow^ = for this specialization of the coefficients of F. 

A simple geometric explanation for this behavior of Chow^(-) is that the choice of A defines 
a toric variety T{A) into which the roots of F in T(P) are projected. (The variety T{A) is 
the toric variety corresponding to a point set ||GKZ94|| , and is simply the image of T{P) 
under the morphism (pA-) So depending on our choice of A, the roots of F in T{P) may 
or may not correspond to roots of F in T{A) in a well-defined way. For instance, in our 
example, F actually has infinitely many roots in T{A), so Main Theorem |^ tells us that 
Chow^ must vanish. 

So it more useful to work within T{P), since the roots of F in T{P) and T{A) are 
actually images of the roots of F in T{P). In particular, the underlying algebraic maps 
induce projections of certain faces of P (corresponding to certain parts of T(P)\(]K*)") onto 
certain faces of P and Conv(y4). Figure 2 below illustrates this, along with where the root 
[1 : — 1 : : 0] gT(P) of F "goes" within these various compacta. For instance, note that P is 
a cub octahedron, and ipA is constant on the portions of T(P)\(K*)" corresponding to the 
triangular faces with inner normals — Ci, —62, —63, and (1, 1, 1). 



^^We also need the fact that the Pedersen-Sturmfels formula, originally stated only over C, remains true 
over a general algebraically closed field (cf. section pT^ ). 

^^If charlKs {2, 3} then F will actually have infinitely many roots in T{P). So let us assume henceforth 
that char]K^{2, 3}. (It is easy to construct similar examples when charIKe {2, 3} as well.) 
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Algebraically, we have the following maps: 

r(p) 

V ^ 

p| = T(p)-^--^--^r(A)^p| 

where vr is the natural projection between compatible toric compacta (cf. section |]), and is 
the rational map (defined just on (K*)") from T{P) to T{A) obtained from x i— > [x*^ | aG A]. 
In the case at hand, the latter map is simply the identity map between the two corresponding 
naturally embedded copies of (K*)^. 

To remedy the preceding trivial Chow^, we can instead use Chow^/('u) with A' := {(0, 1, 1), 
(1, 0, 1), (1, 1, 0), (1, 1, 1)}. (This choice is motivated by trying to pick an A which is com- 
patible with P (cf. section |]).) In particular, when the coefficients of F are unspecialized, 

ai a2 a-i 04 



Chowyi/ {u) = det 



fel 

Cl 

.^(0,1,1) 



62 

C2 
^(1,0,1) 



63 

C3 
^(1,1,0) 



fe4 

C4 

"(1,1,1). 



So under our last specialization, this becomes 



12m(i_o,i) — 12u(o,i,i). Note that we now recover our root [1:— 1:0:0] from the coordinates of 
our new twisted Chow form. For example, the ratio of the x-coordinate to the y-coordinate 

X _ x^y'^z^ _ 12 _ _i 
y ~ x^y^z^ ~ -12 ~ 



is just 
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Alternatively, we can simply use Pert^i and forget about cleverly chosen A'. For example, 
by Main Theorem 3 (and theorem 1), we can simply take F* = {yz + xyz, xz + xyz, xy + xyz). 
After an application of Maple, we then obtain that PertAnz^ is exactly 5ui + 21u2- In 
particular, while the point [0:5:21:0] eT{A) does not correspond (in any obvious way) to a 
root of F in T{P), it is the image of a bona fide root of F in T(P) under the morphism ip^- 

In closing, we emphasize that in practice we would never actually compute the full mono- 
mial expansions of ChowA{u), PertA(M), or T-C{u; s) — we would instead recover the roots 
of F (or evaluate monomials thereof) via rapid and sophisticated interpolation techniques, 
e.g., [ |Can88| , |(JKL89| , |Uan90| , pK95| . In particular, this is the approach of Main Theorem 1, 



and our calculations can be sped up tremendously with suitably optimized code. 
3.4. The "Dense" Case. 

Our last example illustrates a simple fundamental case. 

Suppose E is the n-tuple (diAnZ", . . . , rf^AflZ") where diEN for all i. (So we are now 
considering the family of all nxn polynomial systems where has total degree < di for 
all i.) It is then easily verified that the system F* = {xf^, . . . , x^") (with support contained 
in E) has only finitely many roots in T{P). Indeed, in this case, T{P) = and there is 
exactly one root (of multiplicity Yl^i) origin O. Note also that our current setting is 

sufficiently simple that we could find a suitable F* with just n terms, without the need for 
an irreducible fill. 

Remark 3. Letting du '■= Yldi and c/g := Y17=i^i' ^'^ easily checked that A4{E) = du 
and R{E) = S{E) =(^'^^^^^ the dense case. The last equality follows from Macaulay's 

19^ century construction of the multivariate resultant [Pan87|| . So in the dense case, our 
complexity bound from Main Theorem 1 thus specializes to the best hounds from ||Can88|, 



|CKL89| , |Can90|| , once charK = and randomization is allowed (cf. Corollary^ of section |^. 



Letting y4 = AnZ", we then see that our polynomial T-C{u;s) is simply the original GCP 
Can90|| , but extended to a general algebraically closed field. In particular, our F — sF* is 



the polynomial system (/i — sx*, ... , /« — sx'^"). (Note also that if we set rfi = ■ ■ ■ = (i„ = 1 
then we recover the usual characteristic polynomial of a matrix). Finally, note that T{A) = 
T{P) = T{P) = PJ^ and the map ipA is the identity. So by considering the zero set of F in 
T(P), in this "dense" case, we are just considering the zero set of F in P|| in the usual way 
via homogenizations. Thus by Main Theorem ^, Canny's original GCP indeed finds a point 
in every irreducible component of Z in P|^, as conjectured in 1990. Of course, the advantage 
of the toric GCP is that we can do the same with greater efficiency for sparse systems with 
small M{E). 

4. Filling 

Here we briefiy recount filling and some related concepts. Some of the material below is 
covered at greater length in [[Roj94|| . The results below form the basis for our combinatorial 



approach to perturbing degenerate polynomial systems. 
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Let S^~^ C M" denote the unit {n — l)-sphere centered at the origin. For any compact 
B C M" and any w G M"', define to be the set of x E B where the inner-product x-w is 
minimized. (Thus B^ is the intersection of B with its supporting hyperplane in the direction 
w.) We then define E"" := {E^, ... ,E^) and DnE"" := {Di n . . . , D„ n E^f ). 

Recall that the dimension of any B CM", dim 5, is the dimension of the smallest sub- 
space of M" containing a translate of B. The following definition is fundamental to our 
development. 

Definition 4. Suppose C := (Ci, . . . , C„) is an n-tuple of polytopes in or an n-tuple 
of finite subsets o/M". We will allow any Ci to be empty and say that a nonempty subset 
J C [n] is essential for C (or C has essential subset J) <^=^ (0) Supp(C) ^ J, (1) 
dim(^jgjCj) = #J — 1, and (2) dim(^^.gj, Cj) > #J' for all nonempty proper J'^J. 

Equivalently, J is essential for C <^==^ the ^ J-dimensional mixed volume of {Cj | j G J) is 
and no smaller subset of J has this property. Figure ^ below shows some simple examples 
of essential subsets for C, for various C in the case n = 2. 



Ci C2 



{1},{2} 






Co 



{1} 



{1,2} 



None 



Figure 3. The essential subsets for 4 different pairs of plane polygons. (The 
segments in the third pair are meant to be parallel.) 



A basic fact about mixed volumes is that A4{E) = <^==^ E has an essential subset, 
whenever Supp(i?) = [n]. However, there is an even deeper connection between filling and 
essentiality: 



Theorem 5. [ |Roj94 , sec. 2.5] Suppose D and E are n-tuples of finite subsets of 17^ such 



that A^(E) > 0. Then D fills E <^ for all w G S""^ , Supp(L' n E"") contains a subset 
essential for E^ . 



Remark 4. One certainly need not check infinitely many w. In fact, we need only check 
one w (just pick any inner normal) for each face of the polytope P = Y17=i Conv{Ei). 

We also recall the following important observation. 

Lemma 1. Let rn:=Y^'^^-^^^Ei. Then there is an absolute constant c>0 such that we can 
decide whether Ai{E) > within 0{rrf) arithmetic steps overQ. Furthermore, if /iA{E) =0, 
then we can find points Pi, . . . ,p„ G (N U {0})" such that J^{{pi} U Ei, . . . , {p„} U En) > 
within the same asymptotic complexity bound. ■ 

The first portion appears in [pK94|| , while the second portion is an elementary corollary. 
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Oddly enough, filling seems to have originated from an algebraic problem: genericity 
conditions for counting the roots of sparse polynomial systems. This aspect is explored 
much further in ||Roj94| , |RW96| , |Roj99|| . We also emphasize that constructing a fill need only 
be done once for a given family of problems, provided E remains fixed. The situation where 
the monomial term structure of a polynomial system remains fixed once and for all, and the 
coefficients may vary many thousands of times, actually occurs frequently in many practical 
contexts such as robot control or computational geometry. 

To conclude our background, we will need the following lemma characterizing irreducible 
fills. 

Lemma 2. Following the preceding notation, assume Ai{D) > 0. Then D is irreducible 
■^=r- for any v lying in some Di, there exists a w E Q" \ {O} such that Df = {v} and 

Proof: First note that the mixed volume condition above is equivalent to {i} being the 
unique essential subset of D^. This follows immediately from definition ^ and, say, the 



development of ||BZ88 

The "<^^" direction then follows almost immediately from theorem ^ If the mixed volume 
condition holds, then the removal of any point from D would indeed violate the filling 
condition from theorem |^. So the removal of any point from D would make Ai{D) decrease. 
The converse implication follows almost as easily: 

Suppose, to derive a contradiction, that D is irrreducible but there is some v in some Di sat- 
isfying the following property: For all w e Q"\{0}, #Df > 2 or . . . , D^^^, DJ^^, . . . , 
D^) =0. Let us then consider the n-tuple D' := {Di, . . . , A-i, Di\{v}, -Dj+i, • • • , -D„). Then 
by theorem |^ once again, D' fills D. But this contradicts the irreducibility of D, so we are 
done. ■ 

5. ToRic Geometry and the Proofs of Our Main Theorems 



m 



Our notation is a slight variation of that used in [[Ful93|] , and is described at greater length 
[Ro.i99|] . However, we will briefiy review a few important facts and definitions. 
The (inner) normal fan of a polytope Q C M", Fan((5), is simply the collection of cones of 



inner normals of faces of Q ||GKZ94|| . (For instance, the inner normal fan of the standard unit 
square in the plane consists of nine cones: the four quadrants, the four nonnegative coordinate 
rays, and the origin.) We will assume the reader to be familiar with the construction of a 
toric variety from a fan, a polytope, or a finite point set |[Ful93| , |GKZ94 . 

Example 2. When A = A (1 Z", it is easy to derive from scratch that T{A) is just the 
projective space PJj^. More generally, if Conv(A) is a product of simplices, then T{A) is a 
product o/ twisted projective spaces ||i''ul93|| — hence our appelation for Chow a. Note also 
that the coefficients of Chow a o^^e multisymmetric functions of {ipA{0}c C ranges over 
the roots of F in T{P). 

Let us now list our cast of main characters: 



Definition 5. ||Ful93| , [Roj99|| Given any wGM", we will use the following notation: 
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T 



V. 



0", 



Div(/) 



'Dq{F,V) 



The algebraic torus (K*)" 

The face of Q with inner normal w 

The closure of the cone generated by the inner normals of Q"^ 

The dual (or angle ) cone {w'gM" | w'-y>0 for ally^a^,}- 

The affine chart ofT{Q) corresponding to all semigroup homorphism^ a^flZ"' 

The point in corresponding to the semigroup homomorphism cr^ fl Z" — > 

mapping p H-i> 6w.pfi, where Sij denotes the Kronecker delta 

The T -orbit of = The T -orbit corresponding to the relative interior of 

The T -invariant Weil divisor ofT{Q) corresponding to a polytope Q' . 

The Weil divisor ofT{Q) defined by a rational function f on (K*)" 

Div(/) -\- Sq{Q') = The toric effective divisor ofT{Q) corresponding to {f,Q') 



{0,1} 



The (nonnegative) cycle in the Chow ring ofT{Q) defined by f]- 
V={Pi,... ,Pk) 



T^Q{fi,Pi), whenever 



We say that P is compatible with Q iff every cone of Fan((5) is a union of cones of 
Fan(P) ||Kho77| , [Ful93| , ^aPH - (So P compatible with Q ^ P has at least as many facets 
as Q.) 




Figure 4. The inner polytope is compatible with the outer polytope. Also, 
the corresponding "outer" toric variety can be obtained as a deformation (or 
image under a proper morphism) of the "inner" toric variety. 

Finally, whenever F is a A; x n polynomial system with support contained in we 
will define the zero se t|3 of F in T{Q) to be the toric cycle Vq{F,V), where V : = 
(Conv(£'i), . . . , Conv(£'fc)). Toric infinity is then defined relative to Q: it is simply the set 

T(g)\(K*r. 

Example 3. (Zero Sets in PJ^) Suppose Q = a -\- (3 A, for any a G and any rational 
P>0. Then T{Q) = canonically. As for explicitly defining the zero set of F in T{Q), 
we can do the following: (1) Define vectors pi, . . . ,Pn&'^^ such that for all i, x^^-fi G K[x] 
is not divisible by any Xj, (2) define fi{x) : = x^x''*/(^, . . . , f^) for all i, where di is the 

^•^Note that the domain and range spaces are respectively semigroups under the natural operations of 
vector addition and field multiplication. 

When necessary, we will also use the underlying scheme structure. 
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total degree of x^^ fi. Then [zi : ■ ■ ■ : Zn ■ Zoo] is a root of F iff fi{z) = ■ ■ ■ = fn{z) = 0. 
In particular, note that this toric definition differs from the classical definition of "zero set 
of F in Pj|/' due to the extra step (1). For instance, our toric definition might omit some 
affine roots, for certain E and F. However, note that step (1) is unnecessary when O&Ei 
for alii. Furthermore, by |[Roj99| , sec. 6.1], the zero scheme of F in K" embeds naturally in 
Vp{F, V) (and Vp{F, V) ) when we replace E by OU E. Hence the introduction of OU E in 
(and A = A n Z" in the proof of) Main Theorem 1. 

The following result will provide some necessary geometric intuition for specializing re- 
sultants. The lemma immediately following then gives a more explicit algebraic analogy 
between the faces of Q and the affine charts of T{Q). 

Vanishing Theorem for Resultants. [[Roj98b|| Suppose F is an {n+l)^n polynomial sys- 
tem (overK) with support contained in E. Then, provided Ai{Ei, . . . , Ei_i, -Ej+i, . . . , -En+i) > 
for some i G [n+1], Res^(F) = Vp{F, V) ^ 0, where V := (Conv(Ei), . . . , Conv(E„+i)) 
and P = ^"+1^ Conv{Ei) . ■ 



Lemma 3. |[Roj99| , sec. 4.2-5.1] Suppose F is a kxn polynomial system overK. with support 



contained in a k-tuple of integral polytopes V := (Pi, . . . , Pk) in M". Assume further that Q 
is a rational polytope in M". Then the defining ideal m ]K[x" | a G fl Z"] of Uw n Vq^F, V) 
is {x^^fi I for all iE [k] and biEZ"^ such that h + Pi C a'^). ■ 

Lifting (or projecting) from one toric variety to another is an important fundamental ideal 
we will also use. The following lemma follows directly from the development of ||l''ul93| . 



Lemma 4. Suppose QcM" is an n- dimensional rational polytope, and B is either a nonempty 
finite subset of 11^ or a rational polytope in M". Assume further that Q is compatible with 
Conv(i?). Then there is a natural (surjective) proper morphism tt : T{Q) 'T{B). In par- 
ticular, t[{Vq{F,V))=Vb{F,V) , where the latter cycle is the image ofVconv(B){F','P) under 
the natural proper morphism from T(Conv(i?)) to T{B). Furthermore, 71(0^) = 0^], where 
the corresponding T-orbits are considered in their respective domains, and 7r|(K.)n=id. ■ 

Remark 5. Following the notation of Main Theorem 1, it easily follows that if A = A (1 
then the multiplicity of any root of F in (K*)" is preserved under the map (pA- If A = dQr\7/^ 
for some rational polytope compatible with P, and c/ G N zs sufficiently large, then the same 
will be true of any root of F in T{P) |[Ful93|] . In general, thanks to the functoriality of Chow 
forms |PS95|] , Chow^i is precisely the Chow form of the subscheme ipA{Z) o/P*"^"^. 



Another immediate corollary of our last lemma is the following result on the meaning of 
the projective coordinates [C" | aEA\. 

Corollary 4. Following the notation of Main Theorem 4, let (eT{P) be an isolated root of 
F and fix a vertex of y eCouv{A) with inner normal w. Then ^Pa{0 Hgs in the affine chart 
Uw of T{A) ■^=^ the coefficient of in the corresponding factor of Pert^ is nonzero. ■ 



SOLVING DEGENERATE SPARSE POLYNOMIAL SYSTEMS FASTER 



19 



Example 4. Suppose we take A = A fl Z" as usual. Then T{A) = canonically, and 
there are exactly n + 1 affine charts of T{A) corresponding to vertices. These charts are 
respectively isomorphic to minus the hyperplane at infinity, and P||\{xj = 0} as i runs 
through [n]. For example, given a factor o/Pert^ such as Uq+u^, we know that it corresponds 
to a root image <^a(C) which lies in two of these affine charts and outside of n — 2 others, 
i. e., (Pa{C) = [0:0:1:0: ■ ■ ■ : : 1] lies on the x^-axis. Similarly, if all the coordinates of (Pa{C) 
are nonzero, then (, fAiC) ^ (K*)". 



Finally, we will need a version of the fundamental fact that F generically has exactly 



A4{E) roots in (K*)". The case ]K = C first appeared in ||Ber75|l , and the general case is an 



immediate corollary of [|Ro,i99|, Main Theorem 2]. 



Lemma 5. Let Ce be the vector of coefficients of F and define jj^E := ^11=1 i^-^i- Then 
there is an algebraic hypersurface S^; C K*'^ such that C e W^^\'Ee =^ F has no roots 
in T(P)\(K*)"'. Moreoever, the latter assertion implies that F has exactly M.{E) roots, 
counting multiplicities, in (K*)". ■ 



With all our technical background complete, we can now prove our main theorems. 



5.1. Polynomial Algebra and the First Half of Main Theorem 1. 

Our proof of assertions (0)-(2) of Main Theorem 1 will rely on two main constructions: the 
toric perturbation PertAnZ" and an extension of Canny's constructive version ||(Jan88| | of the 



primitive element theorem. We thus emphasize that while Chow^ and Pert^ permit one to 
reduce polynomial system solving to multivariate factorization, we will not use factoring to 
build h and hi, . . . , hn- 

Algebraically, the idea is as follows: Our techniques allow us to find a set of points Z' C 
(K*)" intersecting every irreducible component of the zero set of F in (K*)". Consider the 
field extension L:= K{Z'), obtained by adjoining all the coordinates of all the points of Z'. 
Then L is a finite extension of K, and by the primitive element theorem |[Van50|| , L = K{6) for 



some Furthermore, by the same theorem, we should be able to recover the coordinates 

of every point in Z' in terms of rational functions (with coefficients in K) of 6. Since 
K{6) = K[t]/h{t) when h is the minimal polynomial of 6 over K, we can further simplify the 
preceding rational representation to one in terms of polynomials in 6 with coefficients in K. 
Our algorithm for Main Theorem |1] will explicitly construct this encoding for us. 

To describe our algorithm, we will first need a bit of subresultant theory: For any univariate 
polynomials f(t) = ao + + ■ ■ ■ + cudif^^ and g(t) = Po + Pit + ■ ■ ■ + ^(12^^^, consider the 
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with (ii — 1 "/3 rows" and (i2 — 1 "« rows." Let Ml (resp. Mq) be the submatrix obtained 
by deleting the last (resp. second to last) column, and let 7^j(/, 5^) : = det(M/) for iG{0,l}. 
Finally, define the first subresultant of / and g to be Tlo{f,g) + Tli{f,g)t. It is then a 



classical fact that if gcd{f,g) = a + bt with 6 7^ 0, then 
heavy use of this fact in our proof. 



noif,9) 



We will make 



Recall also the following algorithmic facts about polynomials over any field pP94]| : 

a. Given the values of a univariate polynomial of degree d a.t d + 1 distinct points, the 
coefficients of the polynomial can be recovered within 0*{d) field operations. 

b. The gcd of two univariate polynomials of degree 0{d) can be found within 0*{d) field 
operations. 

c. The coefficients of the square-free part of a univariate polynomial (of degree d) can be 
found within 0*{d) field operations. 

d. The subresultant of two univariate polynomials of degree 0{d) can be computed within 
0*{d) additions and multiplications. 

We now proceed with our proof of the first half of Main Theorem 
Proof of Assertions (0)— (2): To simplify matters slightly, we will first derive a Las Vegas 
version of our algorithm for Main Theorem |l|. The announced time bound will then follow 
from a simple derandomization. The construction of h,hi, . . . ,hn will follow from evaluat- 
ing PertAiu) at various specializations of u, thus reducing to 0{n) univariate polynomial 
interpolation and gcd problems. In particular, our algorithm can be outlined as follows: 

Step Set y4 = A n Z" and fix generic values in K for ui, . . . , Un- 
Step 
Step 



Define /iG]K[t] to be Feit A{t, Ui, . . . , Un)- 

If n = 1, set hi{9) ■.= 9 and stop. Otherwise, for all « G [n], let q~(t) be the square-free 
part of PertA(t,Mi, . . . ,Ui-i,Ui - ... ,m„). 

Step 3 Let a satisfy either a = l or a{a + 1) = 1 according as charK 7^ 2 or charK = 2. Then 
define q'*(i) to be the square-free part of Pert^(t, Ui, . . . , Mj-i, Ui + a, Wj+i, . . . , Un) for 
all 2 G [n] . 

Step 4 For all i G [n] and j G {0, 1}, let rij{9) be the reduction of Tlj{q^{t), q*{{a + 1)6' — at)) 
modulo h{9). 

Step 5 For all iG [n], define hi{9) to be the reduction of —9 — ^r^^ modulo h{9). 
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Note that assertion (0) thus follows immediately from Steps 1 and 4, thanks to the beginning 
of Main Theorem ^. Let us now verify the correctness of our algorithm, clarifying the 
genericity assumption of Step along the way. 

Using Main Theorem 4 once more, we know that the factors of Pert^ define for us a 
set of points Z = {C^-'^jelN], with < Ai{E), such that Z intersects every irreducible 
component of </)^(Z). In particular, we see that the roots of h are exactly {O^-'^jeN', where 

:= - Er=i CF^w^, Z' := {(^^^}jeN' = ZnK^, and C^^'^ = (Cp\ • • • , d'^) for all j G N'. 
Furthermore, it is easy to check that for all but finitely many [ui : ■ ■ ■ : u„] , j ^ j' =^ 

case, via remark ^ in section |^, the multiplicity of any isolated root 
(^^■'^ G (K*)" of F is exactly the multiplicity of the root ^^■'^ of h.) Similarly, for any i G [n], 
J ^ f =^ eU) + ^ ^0') + (P and e^^^ - aCF^ ^^'"^ - c^CP , for all but finitely many 
[ui : ■ ■ ■ : Un]- The avoidance of these 1 + 2n finite sets of [mi : ■ ■ ■ : m„] is precisely our 
genericity condition for Step 0. Furthermore, by checking square-free parts, we can check 
our genericity condition with negligible overhead (via fact (c)). 

Now note that if 6 = 6^^^ for some j, then for all zG [n], q^(t) = q*({a + 1)^ — at) = <^==^ 
t = ^(-?) + (^p^ Furthermore, by construction, this common root has multiplicity 1 for both 
and q*. It is then easily checked that hi{9''^'>) = c!f \ 

Recalling that the zero scheme of F in {Wf is exactly Va{F, V) n (K*)" | |Roj99| , sec. 5.1], 
we have thus proved assertions (1) and (2) of Main Theorem |I]. ■ 

Remark 6. The probability of failure in our Las Vegas algorithm above is 0, assuming any 
probability distribution on the coefficients of F yielding probability 1 avoidance of algebraic 
hypersurfaces m K*"^""^""^'^' 

5.2. Concluding the Proofs of Main Theorem 1 and Corollary |T]. 

We begin by checking the complexity of our Las Vegas algorithm from the preceding section. 
First note that by Main Theorem ^, each evaluation of Pert^ (for constant Mo,Mi, . . . ,Mn) 
takes 0*{nR{EfS{Ef-^'^^) arithmetic steps over K. So by observation (a) above (and asser- 
tion (0)), we can find h via interpolation within time 0*{nM.{E)R{EYS{Ey''^'^^). Similarly, 
by (a), (b), and (c), we can find each q~ and q* within the same time bound. So the construc- 
tion of all these polynomials thus takes a total of 0*{n'^M{E)R{E)'^S{E)'^-^'^^) arithmetic 
steps over K. 

Finding the coefficients of 5'*((« + 1)^ — at) takes time 0*{A4{E)'^) via another simple 
interpolation step. So by (d), we can then find hi, . . . , /i„ still within the latter asymptotic 
time bound. As for space, we only need to keep track of 0{nJ^{E)'^) coefficient values, and 
this falls well within the 0{nS{Ef) space requirement of Main Theorem H. 

To conclude, we need only derandomize our algorithm. This can be done as follows: replace 
the generic selection of mi, . . . , m„ above by Ui = for i G [n]. We then obtain that at our 
genericity condition is violated iff the point (!,£,... , e") G K"-"*"^ lies in at least one of (2n-|- 1) 



hyperplanes depending on the input F. From the box principle, and the well-known 



properties of the Van der Monde matrix | BP94 ], this can happen to at most n{2n + ly-^^^') 
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distinct values of e. So we can derandomize by repeatedly running steps (l)-(3) with new e 
at most n{2n + times, thus finally accounting for our aforementioned deterministic 

time bound. 

Moving on, we must now further refine our algorithm so that our arithmetic is over K (or 
a small algebraic extension thereof) instead of K. This can be done as follows: If charK = 0, 
then there are enough choices for e in to derandomize our algorithm (since K will be 
infinite). Otherwise, we simply choose e in an algebraic extension of K of degree 
[logp((n + l)^A^(-E')^)] , so that we have more than enough e to choose from. Assertion (3) 
is now proved. 

To conclude, it immediately follows from the development of ||Roj99| , sec. 6.1] that the zero 



scheme of F in K"- embeds naturally in T)p{F, V) (and Vp{F, V)) if we replace E by OU E. 
So this introduction of extra points into our supports indeed guarantees that Z' includes all 
the affine roots of F. ■ 

Remark 7. We have thus improved the complexity of finding all the afRne roots from 
(roughly) polynomial in Yldi to polynomial in Ai{0 U E) . However, one can improve this 
even further to polynomial in SA4{E) — the stable mixed volume ||HS97| , |Roj98c|| of E. (In 



particular, SA4{E)<M.{OLi E)<Y[di and the gaps between can be quite large (cf. example 
^.) To make this final improvement, it is necessary to use a more refined resultant operator 
— the afRne toric resultant, denoted AffResg{F) |[Roj97b|| . This is covered at greater length 
|Roj97b| , [Roj98c|| , and this new operator also allows us to extend Corollaries || and ^ to 



m 



K" minus an arbitrary union of coordinate hyperplanes. 

Proof of Corollary |1]: It follows immediately from our proof of Main Theorem 1 that the 
fields K[Ci I (Ci, . . . , Cn) e (K*)" is a root of F] and K[9 \ h{9) = 0, H 0] are identical 
when Ml, . . . ,Un are chosen from K. (So the assumption that char]K = is actually stronger 
than necessary.) Since the latter field is exactly the splitting field of g, we are done. ■ 

5.3. The Proof of Main Theorem 2. 

We first note that the well known results on the degree of Res£.(/i, . . . , fn+i) with respect 
to the coefficients of various fi ||Stu94|| remain true over any algebraically closed field. This 



follows easily from the formulation of the resultant for a collection of invertible sheafs on a 
projective variety ||GKZ94|| . In particular, Chow^ should indeed be either be identically zero 



or a homogeneous polynomial (in the Ua) of degree A4{E). 

To prove assertions (l)-(3), we can then simply invoke the Vanishing Theorem for Re- 
sultants and lemma ^ (since P is compatible with Conv(A)). For instance, we obtain that 
(fA{Z) is positive-dimensional iff Chow^ has infinitely many distinct divisors of the form 
XlaGA^aWa- So asscrtiou (1) follows immediately. Assertions (2) and (3) follow similarly. ■ 

5.4. The Proof of Corollary |. 

Let u (< 2.376) denote the famous matrix multiplication exponent ||CW90|| and set =A. 



It then follows immediately from [ !JW90|| and |[EC95| , The Division Method] that for any 



choice of constant coefficients in K, Res£'(F) can be evaluated within 0*{nR{E)S{EY) 
arithmetic operations over K, using 0{nS{EY) space. 
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The first part of Corollary then follows immediately from a Van der Monde type ar- 
gument, as in the proof of Main Theorem |1]. In particular, via interpolation, it suffices to 
evaluate Chow a{u) at exactly 1 + nAi{E) distinct points of the form {l,e, . . . to see if 
ChoWyi is identically zero. 

To then count the roots of F in (K*)*^ when Chow^i is not identicaly zero, we can begin 
with a variant of the algorithm from Main Theorem |I| where we evaluate ChowAnz™ instead 
of PertAnZ"- From our previous observations, we can thus construct h and hi, . . . ,hn within 
time 0*{n'^M{EYR{E)S{E)'^) and space 0{nS{E)'^). 

We then use the following trick: Compute the gcd, g, of h and YYi=i ^i- ^y remark ^ 
of section |^, we immediately obtain that deg h — deg g is exactly the number of roots of F 
in (K*)" counting multiplicities. (In fact, the roots of g tell us precisely which (^^^ lie out 
of (K*)".) By the same argument, we can also count the number of distinct roots simply 



by replacing h with its square-free part. By facts (b) and (c) of section [ST^, and since the 
degree of HILi most nAi{E), these computations cause a negligible growth in our 

asymptotic complexity bounds. So we are done. ■ 

5.5. Facet Searches and the Proof of Main Theorem 3. 

The first portion of this result follows immediately from lemma ^ and [ [Roj94| , Corollary 3] . 
The second portion is a consequence of the following algorithm: 

Step 1 Compute the facet normals of P and the vertices of all the Conv(ii^j). 

Step 2 Find a vertex v of some Ei such that for any facet normal w of P, Ef =^ [H^Ef > 2 

or Ai{Ei, . . . , Ei^i, Ei+i, . . . , En) = 0]. If no such pair v exists, stop. Otherwise, delete 

any point of E"^ and go back to step 1. 

By lemma ^ the above algorithm will eventually stop with an irreducible fill of E. As 
for its complexity, note that the number of facets of P is (9(m^"), and we can find the 



normals to these facets within that many arithmetic steps over Q ||CS93|| , given the convex 
hulls of the Ei. Furthermore, this asymptotic bound dominates the complexity of finding 
the convex hulls of all the Ei ||PS85|| . So the complexity of Step (1) is 0{m'^^). Step (2) thus 
amounts to (n + l)(9(m^") checks for zero mixed volume per vertex. So by lemma |l|, this 
takes (9(?T,m^"+^) arithmetic steps over Q. These steps will be executed at most m times, so 
we are done. ■ 

5.6. Algebraic Homotopies and the Proof of Main Theorem 4. 

Main Theorem |^ is the cornerstone of our approach to solving degenerate systems of equa- 
tions, so we will precede its proof by illustrating one of its underlying constructions: explicit 
algebraic deformation of degenerate zero sets. 

More precisely, following the notation of Main Theorem |, we will construct a family 
of curves C, fibered over the projective line, whose fiber over a particular point is a zero- 
dimensional variety Z^Z encoding the multiplicities of all the irreducible components of Z. 
To do this, we begin with the following lemma, which follows easily from the development 
of ||Ro.i99| , sec. 5.1] and |F\Il84i sec. 11.3]. 
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Lemma 6. Following the notation of definition ^ and Main Theorem ^ let Zq be the zero- 
dimensional part of Z. Also let Z^ he the zero scheme of F — sF* in T{P) x Pj^. Then 
Z = Z^ n {T{P) X {0}). Finally, let C he the algehraic curve (possihly reducihle) defined 
hy the union of all one- dimensional components of Z^ with surjective projection onto the 
second factor of T{P) x Pjjj. Then C has the following properties: 

1. n (T(P) X {so})=Cn (T(P) X {so}) for almost all s^ef]^. 

2. Z ■=C\~\{T{P) X {0}) is a suhscheme of Z consisting of exactly A4{E) points (counting 
multiplicities). Furthermore, Zq is a suhscheme of Z . 

3. Let W he any irreducihle component of Z. Then Z has at least one point in W and, 
for a generic choice of F* , the numher of points of Z in W (counting multiplicities) is 
exactly the cycle class degree ofW. ■ 

We can now begin our most important proof. 
Proof of Main Theorem |^: Similar to the beginning of the proof of Main Theorem H, the 
resuhs of [ Stu94 | (generalized to arbitrary algebraically closed K) immediately imply that the 



degree of H as a polynomial in s should be XliLi -^(-^i) • • • ; -^i-i; Ei+i, ... ,En,A) <R{E). 
Also each coefficient of Ti.{s) should be a homogeneous polynomial (in the Ua) of degree 
A4{E). These two assertions of course include the opening statement of Main Theorem ^ 
(on the degree and homogeneity of Pert^), but they will follow only upon showing that Ti. 
is not identically zero. 

To see this, note that lemma ^ and the Vanishing Theorem for Resultants readily imply 
that the coefficient of the highest power of s in 7i is precisely 'Res(E,A){F* , fn+i)- (Simply 
check the zero set of F — sF* in T(P) at s = oo, via the homogenization s'F — sF*.) By 
definition ^ and the Vanishing Theorem once more, we see that this polynomial in the Ua is 
not identically zero. SoH^O and we've finished the simplest part of our proof. 

Part (1) of Main Theorem ^ follows similarly: One need only consider the unspecialized 
resultant polynomial Res(E,A){F, fn+i) and observe the terms of degree in s as we speciaUze 
coefficients to obtain F — sF*. In particular. Chow a{u) is precisely TC{u; 0). Note then that 
(2) and (3) also follow almost immediately, provided Chow^ is not identically zero. 

To properly handle the cases of (2) and (3) where we are actually working with a non-trivial 
toric perturbation, we now invoke lemmata ^ and ^ to establish a precise correspondence 
between the factors of PertA and the points of Z. 

Letting Z^ ^ be the zero set of TC{u; s) in P*"^ ""^ x Pj^, note that if k is the least exponent 

of s in Ti, then Z^_^_ and the zero set of ^ in P^^""*^ x differ only by the presence 

of the hyperplane P*"^"^ x {0}. The second zero set does not contain this hyperplane, 
so let's call the second zero set Z^. By lemmata |] and and the Vanishing Theorem for 
Resultants, we then derive that dim[Z^ fl (T(P) x {so})] = implies the following equivalence: 
TC{H^^(^(^y, So) =0 <^=^ (^Z^ n (T{P) x {sq}), where Hp is the hyperplane dual to the point 
p.0 By assertion (1) of lemma ||, dim[Z^ fl (T(P) x {sq})] = for almost aU sq G P^. 
So (Pa{C) is an open subset of Z^, where we define (Pa{C) := {(y, Sq) | y G H^^(^q ; ( G 
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Soifp:=[pa\aeA]eF*^~^ then {[yj aG A] eP^^-^ | EaeAPaya = 0}. 
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C n (T(P) X {so}) ; SqGP]^}. Therefore, since is a proper map, ^ must vanish on all of 
({>a{C). In particular, via remark |^ of section |^, 

Pert^(M)=a- JJ^ 

Cecn(r(P)x{o}) 

where aGlK*, Ylc^^a I Oj^^'- = Va{.C)) cind the product counts intersection multiplicities. 

Continuing our main proof, assertions (2) and (3) follow immediately from our last formula 
and our preceding observations. As for the complexity bounds, these follow immediately from 
our earlier fact (a) and the Division Method |[EC95|| to compute Res*(-): to evaluate Veit a{u), 
we simply find the coefficients of 7i(u; s) by evaluating 1-L{u\ s) at R{E) + 1 distinct values 
of s and then interpolating. ■ 

Note that our algebraic proof avoids the use of limiting arguments that were present in 
Uan90|| . Thus our result holds for any algebraically closed IK, instead of just C. 



5.7. Double Perturbations and the Proof of Corollary 

The first portion of our final corollary follows immediately (thanks to Main Theorem |) 
by simply replacing Chow^ with Pert^ in the algorithm from the proof of Corollary |^. In 
particular, we obtain that the exact number of roots of F in (K*)" (counting multiplicities) 
is exactly deg/i* — degf?*, where h* (resp. g*) is the corresponding variant of h (resp. g), 
using the notation of the proof of Corollary |[ The number of distinct roots can of course 
be recovered by using square- free parts (as before), thanks to remark ^ of section |. Also, 
by Main Theorem ^, the complexity of this algorithm is just the complexity estimate from 
Corollary ^ multiplied by R{E). 

As for the second portion of our corollary, we make a slightly more sophisticated variant 
of the preceding replacement of Chow a. 

Definition 6. Let F* and F** be nxn polynomial systems with support contained in E 
such that (1) F* and F** each have only finitely many roots in T{P), and (2) F* and F** 
share no common roots. Following the notation of Main Theorem 0, define a double toric 
perturbation of F, Pert^*, to be the greatest common divisor of Pert a,f* o^nd Pert^.F**- 

It is then clear (via Main Theorem ^ once again) that using Pert^* in place of Pert^ in 
our preceding algorithm will lead to a new estimate, deg/i** — deg^f**, for deg(Zo fl (K*)"). 
Furthermore, by the above definition, it is clear that deg/i** — degg** < degh* — degg*. 

As for estimating deg2^o and degZoo, our preceding theory tells us that we can simply 
respectively use degh** and M{E) — degh**. ■ 

Remark 8. Our algorithm thus requires a generic choice of F* and F** . Just as in the 
construction of Pert^, we can derandomize via combinatorial means: We simply use an 
irreducible fill (as in Main Theorem^ to construct F* , and then simply perturb a single 
coefficient of F* to construct F** . This is the trick used in our earlier example in section \3. 4 
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Remark 9. The basic idea behind the double perturbation is that the points in Z := {'~^{6)}^q'^=q 
lying in positWe- dimensional components of ipA{2) will move as we vary F* . Thus, assum- 
ing that F** is such that the new Z overlaps the old Z only on the isolated roots of F , we 
should be able to pick out these isolated roots by computing the gcd o/Pert^.F* and Pert^i^i?... 
We hope to address this "motion of points within a deformation" in future work. 



6. Computing Toric Resultants and the Complexity of the Sparse Encoding 

Let us first recall some important facts on the computation of toric resultants. 

As of 1998, the main method for computing Res£.(F) is to first construct an S{E) x S{E) 
toric resultant matrix, M^, whose nonzero entries are certain coefficients of F . This 
matrix is specifically built so that det(M^) is, for generic choices of the coefficients Ci^ai a 
nonzero multiple of Res^(F). 

Remark 10. So S{E) is actually a parameter depending on which algorithm we use for 
constructing — hence our earlier use of an asymptotic bound, instead of an explicit 
formula, for S{E). The aforementioned bound is actually a simple estimate on the number 
of lattice points in the interior of the shifted Minkowski sum 6 + Yl^=i where 5 G Q" is 
chosen generically. The derivation follows easily from Stirling's estimate for the Gamma 
function, the n-dimensional identity M.{P, ... , -P) =n!Vol(P), and the multilinearity of the 
mixed volume. 



Via some clever interpolation tricks ||CE95| , pi)C95| , [EP97|] , one can recover the exact value of 



Res^(F) after interpolating det(M^) through several-many specializations of the coefficients 
of F. One such fundamental technique, which uses n + 1 versions of M^, is known as the 
Division Method ||Can87| , [EC95|| . In general, the matrix is highly structured (it is 



quasi- Toeplitz |[EP97|| ) and, when charK = 0, this permits Res^(F) to be computed much 



faster than would be expected. In practice, the cost of building (or several versions 
thereof) can be amortized when one works with many F with support contained in the same 
E. Furthermore, when randomization is allowed, the results of ||CE95| , pi)C95| , |EP97|| tell us 
that this preprocessing is actually negligible. 

As for the complexity of computing Res^(F) itself, we state the following additional facts: 

I. |EC95| , The GCD Method] When charK = 0, we can compute 'Res^{F) (for any choice 
of constant coefficients in K for F) within 0*{S{EY^^) arithmetic steps and 0{S{EY) 
space. Q However, we have the added benefit that we can also compute 1-L{u\ s) (for 
any constant mGK*"^) within the same complexity bound. 
II. IIEP97II If we assume charK = and allow randomization, then we can accelerate the Di- 



vision Method (resp. GCD Method) to obtain a Las Vegas time bound of O* {n"^ R{E) S (E) 
(resp. 0*{nS{EY)). Furthermore, either of these improvements requires only 0*{nS{E)) 
space. 



2^ 



The restriction on charK is due to a use of effective Hilbert irreducibility, which actually fails in positive 
characteristic [ Lan83 1 . 
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III. If KesgiF) =det(M^), then we can reduce the deterministic time bounds of the Division 
and GCD methods to 0{R{E)^), regardless of charK. Furthermore, if we also allow 
randomization and assume charK = 0, then we can further improve the time bounds of 
(I) and (II) to 0*{R{E)^). However, characterizing when Res^(F) can be expressed 
as a "small" determinant is an open problem. (See ||WZ94| | for some interesting par- 



tial results, including some cases where the Newton polytopes are products of scaled 
standard simplices.) 



The last fact is actually a simple corollary of the development of ||EP97|| . In particular, in 
the situation of (III), we can skip an interpolation procedure that would have multiplied our 
time bound by 0*{R{E)). 

Let us now state and prove the best current speed-ups for all our preceding algorithmic 
results. 

Corollary 5. Suppose charK = and we allow randomization in our algorithms. Then our 
main algorithmic results can be sped up as follows: 

Sequential (Las Vegas) Time Bound = 0*{- ■ ■) 



Main Theorem 1 
Corollary^ (First Bound) 
Corollary^ (Second Bound) 
Main Theorem 4 



n^M{E)R{EfS{Ef or n'^M{E)S{Ef 
n'^M{E)R{E)S{Ef ornM{E)S{Ef 
n^M{E)R{E)S{Ef or n'^M{E)S{Ef 
n'^R{EfS{Ef ornS{Ef 

Furthermore, the space hound for each of the above algorithms is 0*{nS{E)). Finally, if we 
also have that Res^(-F) = det(M^), then the four pairs of entries in the right-hand column 
(from top to bottom) can be replaced by the following sequence: nM.{E)R{E)^ , A4{E)R{Ey , 
nM{E)R{Ef, M{E)R{Ef. ■ 

Remark 11. As before, the probability of failure in our Las Vegas algorithm above is 0, as- 
suming any probability distribution on the coefficients of F yielding probability 1 avoidance of 
algebraic hypersurfaces in ]K;#monomiai terms ^ ji^g total number of random choices of elements 
in K (or a small algebraic extension thereof) needed is n + R{E). (This is just the number 
choices needed to construct h and a variant ||EP97|| of Mg.) 



So in summary, we can solve any n x n system, over an algebraically closed field of 
characteristic zero, in Las Vegas time near-quartic in the number of roots of a closely related 
system.Q Furthermore, we can go even faster when we have a sufficiently compressed toric 
resultant matrix. Before proving the above corollary, we will briefly explain what we mean 
by a "closely related system." 

First recall that A4{E) is precisely the cycle class degree of the toric divisor Vp[F,V) 
Ful93| , p^o,i99|] . Put more simply, if we simply perturb the coefficients of F, we can ex- 



pect F to have exactly M.{E) roots in T{P). Thus, the quantity A4^p^ defined earlier 



We conjecture that this can be done in positive characteristic as well. The main current obstruction 
is the use (in current fast algorithms) of algebraic identities for recovering elementary symmetric functions 
from power sums, which fail for small positive characteristic. 
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can be reinterpreted as follows: it is the average number of roots of an n x n system of 
equations with support contained in {Si, . . . ,Sn), as we let the £i independently range over 
{£^1, . . . , En+i}, and we assume generically chosen coefficients. So the quantity S{E) can 
also be interpreted as a weighted average of a set of cycle class degrees. Similarly, note that 
the generic number of roots of the (n + 1) x (n + 1) system {F — sF*, s — sq) is exactly 
A4{Ei X {0, 1}, . . . ,En X {0, 1}, {0, Cn+i})- So by the multilinearity of the mixed volume, 
the last mixed volume is exactly R{E). 

Let us now prove our above corollary. 
Proof of Corollary ^: The key bounds to begin with are those of Corollary^. In particular, 
the first bound of Corollary § is the complexity of determining whether Chow a{u) vanishes 
identically. Since this can be accomplished by evaluating Res^(F) at A4{E) + 1 random 
points, facts (I)-(III) above immediately imply our asserted bounds. 

As for the second bound of Corollary |], this is the complexity of running a variant of the 
algorithm of Main Theorem 1, where Pert^ is replaced by Chowy^. Since Chowyi is just a 
specialized resulant, and since this algorithm boils down to evaluating Chow^ at 0{nA4{E)) 
distinct points, facts (I)-(III) immediately imply these bounds as well. 

From the proof of Main Theorem 4, we know that the bound from Main Theorem 4 is 
simply the complexity of evaluating Res^iF) at < R{E) + 1 different specializations of s. 
(Remember that s occurs only in the coefficients of /i, . . . , /„, and all other parameters are 
assumed to be constants.) So this bound follows easily from facts (I)-(III) as well. 

To conclude, the bound from Main Theorem 1 is simply the complexity of evaluating Pert^i 
at 0{nAi{E)) distinct points. From the bound of Main Theorem 4, we are done. ■ 

Are the above complexity bounds the best one can expect for solving polynomial systems 
specified in the sparse encoding?^ Neglecting the precise values of the exponents (which 
we've seen range somewhere between 4 and 7.376, if not better), the answer is "yes." This 
is due to the fact that a generic F will have exactly A4{E) distinct roots in K, regardless 
of the number of terms present. Thus, it is really A4{E), not the number of terms, which 
governs the complexity of global polynomial system solving over an algebraically closed 
field. So the quantities in the "base" of our bounds can not be any smaller (asymptotically) 
than A4{E). As for the exponent, we so far have only the obvious worst case lower bound 
of 1. 

However, the question of whether the number of terms more strongly governs the com- 
plexity of solving over a non-algebraically closed field, or solving for a single root, is quite 
open for n >2. For example, while Khovanskii has shown that the number of real roots 
of a sparse system of equations is singly exponential in the number of terms ||Kho91|| , the 



complexity of real solving is not yet known to fall within such a bound if n > 2. Similarly, 
while a recent algorithm of Ye ||Ye94]| for ^-approximating a single d— root of a G M has 



complexity 0{{\ogd) log log ^), the complexity of finding a single root of F in K" is quite 
open. We hope to address these finer points of sparse algebraic complexity in future work. 



That is, when we specify polynomial systems as a list of exponents and coefficients. 
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We now close with a brief example of how Ai (E) can be smaller than du (the product of 
the total degrees of /i, . . . , /„) by an exponential factor. 

Example 5. (Well Directed Spikes) Consider the system of equations F defined by 
ai,i + ai,2Xi H h ai,„a;„_i + Ci,i(a;i • ■ ■ a;„) H h Ci^d{xi ■ ■ ■ = 



an,l + an,2Xl H h a„,„X„_i + Cn,l{Xi ■ ■ ■ X„) H h Cn,d{Xl ■ ■ ■ XnY = 0- 

In this case, the Newton polytopes are all equal to a single "spike, " and this spike is equivalent 
(via an integer linear map with determinant 1) to a standard n-simplex scaled by d in the 
Xi-direction. So it is easy to check that M.{E) = d. However, the product of the total degrees 
of F is clearly n^d^ . (It is also not hard to see that the best multigraded Bezout bound 
|Wam92|| is nld"'.) Generating infinite families of such examples is easy, simply by picking 
Newton polytopes which are n- dimensional, but "long" in a suitable fixed direction. 

Remark 12. The construction oftoric resultant matrices is an area of active research and it 
can be reasonably expected that our earlier asymptotic estimate on S{E) will be significantly 
improved in the near future. In particular, a significant first step would be to find an algorithm 
which always constructs a toric resultant matrix of size 0{R{E)). Looking even further 
ahead, there is also hope for general algorithms which construct even smaller matrices, via 
the use of entries which are nonlinear polynomials in the coefficients of F. 
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